That's why, earlier this week, I introduced Malark-O-Meter to the world (well...more like only 500 people in the world). I statistically analyzed fact checker report cards from Truth-O-Meter and The Fact Checker to compare the factuality of the 2012 presidential and vice presidential candidates overall, and during the debates that had happened so far. I promised I'd get back to you on the third debate, and with a summary of how the two parties did in the debates overall, compared to one another, and to their usual selves.
Unless Bidama and Rymney (or is it Obiden and Romyan?) blast each other in the next few weeks as much as they have in the last year, this is probably my final 2012 election malarkey analysis until Election Day. This is the one of the most comprehensive, sophisticated, and detailed analyses of the 2012 presidential candidates' factuality.
Share it with your friends. Discuss its results. Debate its merits. Tell me precisely why you think I'm full of shit. Supersize the histograms and past them to brick walls like you're Shepard Fairey. Because this stuff matters. It matters because the facts matter. It matters because we should understand how confident we can be in our judgments about people.
Enough histrionics. Let's get to the science. If you've never been here before, quickly skim how I calculate the malarkey score and how I do my statistical comparisons before continuing. If you read my last 2012 presidential campaign update, not much has changed. So you might want to scroll down to my analysis of the third debate, and the debates as a whole.
Full report cards
I collected the full Truth-O-Meter and Fact Checker report cards for Obama, Biden, Romney, and Ryan this morning. Let's start with what we observe. That is, what can we say about the factuality of the two sides if we take the report cards at face value? Here are the revised overall malarkey scores for each individual candidate and each ticket.
Individuals
| Ticketscollated |
ticket | malarkey |
---|---|
Obama/Biden | 46 |
Romney/Ryan | 56 |
average
(average malarkey of individuals)
ticket | malarkey |
---|---|
Obama/Biden | 48 |
Romney/Ryan | 57 |
The trouble is, for each candidate (and party), we only have a small sample of the statements they've made. That introduces sampling error. We must calculate the certainty with which we can make judgments about the candidates and parties given the data we have.
How certain can we be that the candidates are much better or worse than half full of malarkey? From the probability distributions shown at right, I calculated the probabilities.
Odds are 9 to 1 that Obama's less than half full of malarkey, but not by much. It's almost 100% likely the Romney is more than half full of malarkey. The difference is greater than for Obama, but still not much difference than a half buck of malarkey.
The odds are only around 2 to 1 that Biden is more than half full of malarkey. Again, not by much. The same is true for Ryan.
To the left are the probability distributions of the collated and average malarkey scores. Based on these distributions, the odds are nearly 9 to 1 that the Obiden's collective statements are on average less than half full of malarkey (but not by much), while it is almost certain that Rymney's are more than half full of malarkey (by a wider margin, but still not by much).
It's a statistical toss-up whether Obama and Biden are on average less than half full of malarkey, whereas the odds are over 19 to 1 that Romney and Ryan are on average more than half full of malarkey (but still not by much).
To the right are probability distributions of the ratio between a Republican malarkey score and a Democrat malarkey score. Red bars occur when the Republican malarkey score is greater than the Democrat score. The comparisons run from very murky (for the v.p. candidates) to pretty clear (for the collated ticket report cards and the presidential candidates).
I am essentially 100% certain that Romney spews more malarkey than Obama...but not much more. Not even twice as much. Not even one and a half times as much. I'm also nearly 100% certain that Obiden collectively spew less malarkey than Rymney. But again, they're not that different. Basically, I can't tell a difference between Biden and Ryan. If there is a difference between them, it is tiny, but in favor of Biden. I can, however, give 9 to 1 odds that Obama and Biden are on average more factual than Romney and Ryan. But not that much more factual!
We draw two lessons. First, and I repeat from last time, the differences in factuality between the two parties aren't as large as either side would have you believe. That said, there is a clear difference. The differences we can be certain about favor Democrats. And all of the differences we've measured, regardless of our certainty in them, suggest that the Democratic candidates are more factual than the Republicans.
On to the debates.
Debates
opponent | malarkey |
---|---|
Romney | 57 |
Obama | 46 |
Once again, Romney spews more malarkey than Obama. At least, that's what the data says. But how strong is the evidence? Let's bust out the simulator.
First, I collated the candidates' report cards from each debate, then calculates a malarkey score from it. This measures the average falsehood of the statements a candidate made across all the debates. The odds are about 2 to 1 that Obama's statements during the debates were less than half full of malarkey. The odds are about the same that Romney's statements were more than half full of malarkey.
Second, I averaged the candidate's malarkey score across the three debates. The odds are again about 2 to 1 that Obama was on average less than half full of malarkey during a debate. Statistically, we can't tell whether Romney was on average more or less than half full of malarkey during a debate.
Maybe. In any case, the odds are 4 to 1 he spewed more a few more falsehoods during the first debate than he usually does. The odds are again 4 to 1 that his performance improved during the second debate, when he appeared to be more factual than usual. Statistically, we can't tell a difference between the usual Obama and the Obama in the final debate.
As for the third debate, the odds are only 2 to 1 that Romney was more factual in the final debate than he usually is. Whatever the case, he seems to have lost that one, too.
So much for the persuasive power of facts!
The odds are about 3 to 2 that Biden was less than half full of malarkey during the debate. Contrastingly, the odds are nearly 9 to 1 that Ryan was more than half full of malarkey during the debate. So Biden was probably right. It was all just a bunch of stuff! Well, not all of it. Actually, not much more than half of it was malarkey.
Still, given the small amount of evidence we have from the debate (which introduces a lot of sampling error), it's quite interesting that the odds remain so high that Ryan spewed so much malarkey. Perhaps it was Biden's mastering of the facts after all that dampened the Republican's Romentum! Well, at least it was his ability to point out Ryan's factual missteps. But remember, Biden was about half full of malarkey during the debate, too.
I'd like to think that Ryan's subtle cues of his own falsehood were a letdown to some undecided voters who had expected more from him, but the polls were pretty split about who won the debate.
I came up with two measures of malarkey overall all four debates. First, I simply collated the statements from the presidential and vice presidential candidates into two summary report cards for each party.
Second, I took the average of a presidential candidate's average malarkey across all three presidential debates, and the vice president's debate malarkey score. Let's unpack that a bit. Step one was to average the presidential candidate's malarkey across all three debates. Step two was to take the average of that average and the vice president's malarky score from the vice presidential debate.
Why did I take an average of averages? Because I wanted to measure the average malarky score of an individual on a party's ticket, not the party's average score across the four debates. If I'd done the latter, I would have weighted presidential candidates more heavily, which I already do in the collated measure since presidential candidates were more heavily fact checked, and had more debates, than vice presidential candidates.
Okay, let's look at some graphs and calculate some odds.
The odds are almost 3 to 1 that Obiden's collective statements during the debates were less than half full of malarkey. The odds are about 2 to 1 that the average Democratic candidate's average debate performance was more than half factual.
Contrastingly, the odds are better than 6 to 1 that the collective statements of Rymney were more than half full of malarkey. The case is similar for the average Republican candidate's average debate performance.
Not the similarities between the confidence intervals and means of the debate summaries and those of each party's malarkey scores calculated from their full report cards. These similarities make me confident that malarkey scores taken from full report cards are a pretty good predictor malarkey scores accrued during events like televised debates. Remember also that the candidates' overall malarkey scores were calculated from two fact checkers, whereas the debate data comes from just one.
Maybe there is something to this Malark-O-Meter thing after all. Which brings me to our final plot.
The odds are better than 6 to 1 that the average Republican candidate's average debate performance was more full of malarkey than the average average Democratic candidate's. Again, the difference isn't big, but it's not trivial.
Of course, you don't only care about factuality. You care about policy. You care about issues. But therein lies the rub. When politicians design and advocate for policies, they ideally do so with some grounding in the facts. Evidence matters, or at least it should, just as much to policymaking as it does in a courtroom or a chemistry lab.
What about values? You should care about your candidates' values too, right? But how are your candidates' values informed by the facts?
You see where I'm going here. I understand that factuality isn't the only characteristic we should consider when deciding who gets our vote.
But it sure seems to be at the root of all the others!
VOTE!